Voice activity detection based on conditional MAP criterion incorporating the spectral gradient
نویسندگان
چکیده
In this paper, we propose a novel approach to improve a statistical model-based voice activity detection (VAD) method based on a modified conditional maximum a posteriori (MAP) criterion incorporating the spectral gradient scheme. The proposed conditional MAP incorporates not only the voice activity decision in the previous frame as in [1] but also the spectral gradient of the observed spectra between the current frame and the past frames to efficiently exploit the inter-frame correlation of voice activity. As a result, the proposed VAD leads to six separate thresholds to be adaptively determined in the likelihood ratio test (LRT) depending on both the previous VAD result and the estimated spectral gradient parameter. Experimental results demonstrate that the proposed approach yields better results compared to those of the previous conditional MAPbased method. & 2012 Elsevier B.V. All rights reserved.
منابع مشابه
Toward detecting voice activity employing soft decision in second-order conditional MAP
In this paper, we propose a novel approach to statistical modelbased voice activity detection (VAD) that incorporates a secondorder conditional maximum a posteriori (MAP) criterion. As a technical improvement for the first-order conditional MAP criterion in [1], we consider both the current observation and the voice activity decision in the previous two frames to take full consideration of the ...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملVoice activity detection based on a family of parametric distributions
In this letter, generalized gamma distribution (GCD) is introduced as a new statistical model of spectral distribution to be applied to the likelihood ratio test performed in voice activity detection (VAD). A gradient-based on-line algorithm is proposed to estimate the parameters of GCD according to the maximum likelihood criterion. Experimental results show that the VAD algorithm implemented b...
متن کاملStatistical Model-Based Voice Activity Detection Based on Second-Order Conditional MAP with Soft Decision
© 2012 ETRI Journal, Volume 34, Number 2, April 2012 In this paper, we propose a novel approach to statistical model-based voice activity detection (VAD) that incorporates a second-order conditional maximum a posteriori (CMAP) criterion. As a technical improvement for the first-order CMAP criterion in [1], we consider both the current observation and the voice activity decision in the previous ...
متن کاملJoint spectral distribution modeling using restricted boltzmann machines for voice conversion
This paper presents a new spectral modeling and conversion method for voice conversion. In contrast to the conventional Gaussian mixture model (GMM) based methods, we use restricted Boltzmann machines (RBMs) as probability density models to model the joint distributions of source and target spectral features. The Gaussian distribution in each mixture of GMM is replaced by an RBM, which can bett...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Signal Processing
دوره 92 شماره
صفحات -
تاریخ انتشار 2012